NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Improving Out-of-Vocabulary Handling in Recommendation Systems

Shiao, William; Ju, Mingxuan; Guo, Zhichun; Chen, Xin; Papalexakis, Evangelos E; Zhao, Tong; Shah, Neil; Liu, Yozen (April 2025, Resource Efficient Learning Workshop at The Web Conference (TheWebConf) (2025))

Free, publicly-accessible full text available April 29, 2026
Pure Message Passing Can Estimate Common Neighbor for Link Prediction

Dong, Kaiwen; Guo, Zhichun; Chawla, Nitesh V (December 2024, NeuralPS)

Message Passing Neural Networks (MPNNs) have emerged as the de facto standard in graph representation learning. However, when it comes to link prediction, they are not always superior to simple heuristics such as Common Neighbor (CN). This discrepancy stems from a fundamental limitation: while MPNNs excel in node-level representation, they stumble with encoding the joint structural features essential to link prediction, like CN. To bridge this gap, we posit that, by harnessing the orthogonality of input vectors, pure message-passing can indeed capture joint structural features. Specifically, we study the proficiency of MPNNs in approximating CN heuristics. Based on our findings, we introduce the Message Passing Link Predictor (MPLP), a novel link prediction model. MPLP taps into quasiorthogonal vectors to estimate link-level structural features, all while preserving the node-level complexities. We conduct experiments on benchmark datasets from various domains, where our method consistently outperforms the baseline methods, establishing new state-of-the-arts.
more » « less
Full Text Available
Boosting Graph Neural Networks via Adaptive Knowledge Distillation

https://doi.org/10.1609/aaai.v37i6.25944

Guo, Zhichun; Zhang, Chunhui; Fan, Yujie; Tian, Yijun; Zhang, Chuxu; Chawla, Nitesh V. (June 2023, Proceedings of the AAAI Conference on Artificial Intelligence)

Graph neural networks (GNNs) have shown remarkable performance on diverse graph mining tasks. While sharing the same message passing framework, our study shows that different GNNs learn distinct knowledge from the same graph. This implies potential performance improvement by distilling the complementary knowledge from multiple models. However, knowledge distillation (KD) transfers knowledge from high-capacity teachers to a lightweight student, which deviates from our scenario: GNNs are often shallow. To transfer knowledge effectively, we need to tackle two challenges: how to transfer knowledge from compact teachers to a student with the same capacity; and, how to exploit student GNN's own learning ability. In this paper, we propose a novel adaptive KD framework, called BGNN, which sequentially transfers knowledge from multiple GNNs into a student GNN. We also introduce an adaptive temperature module and a weight boosting module. These modules guide the student to the appropriate knowledge for effective learning. Extensive experiments have demonstrated the effectiveness of BGNN. In particular, we achieve up to 3.05% improvement for node classification and 6.35% improvement for graph classification over vanilla GNNs.
more » « less
Full Text Available
A Survey of Multi-task Learning in Natural Language Processing: Regarding Task Relatedness and Training Methods

https://doi.org/10.18653/v1/2023.eacl-main.66

Zhang, Zhihan; Yu, Wenhao; Yu, Mengxia; Guo, Zhichun; Jiang, Meng (January 2023, EACL)

Full Text Available
Graph-based Molecular Representation Learning

https://doi.org/10.24963/ijcai.2023/744

Guo, Zhichun; Guo, Kehan; Nan, Bozhao; Tian, Yijun; Iyer, Roshni G.; Ma, Yihong; Wiest, Olaf; Zhang, Xiangliang; Wang, Wei; Zhang, Chuxu; et al (August 2023, International Joint Conferences on Artificial Intelligence Organization)

Molecular representation learning (MRL) is a key step to build the connection between machine learning and chemical science. In particular, it encodes molecules as numerical vectors preserving the molecular structures and features, on top of which the downstream tasks (e.g., property prediction) can be performed. Recently, MRL has achieved considerable progress, especially in methods based on deep molecular graph learning. In this survey, we systematically review these graph-based molecular representation techniques, especially the methods incorporating chemical domain knowledge. Specifically, we first introduce the features of 2D and 3D molecular graphs. Then we summarize and categorize MRL methods into three groups based on their input. Furthermore, we discuss some typical chemical applications supported by MRL. To facilitate studies in this fast-developing area, we also list the benchmarks and commonly used datasets in the paper. Finally, we share our thoughts on future research directions.
more » « less
Full Text Available
On the use of real-world datasets for reaction yield prediction

https://doi.org/10.1039/d2sc06041h

Saebi, Mandana; Nan, Bozhao; Herr, John E.; Wahlers, Jessica; Guo, Zhichun; Zurański, Andrzej M.; Kogej, Thierry; Norrby, Per-Ola; Doyle, Abigail G.; Chawla, Nitesh V.; et al (March 2023, Chemical Science)

The lack of publicly available, large, and unbiased datasets is a key bottleneck for the application of machine learning (ML) methods in synthetic chemistry. Data from electronic laboratory notebooks (ELNs) could provide less biased, large datasets, but no such datasets have been made publicly available. The first real-world dataset from the ELNs of a large pharmaceutical company is disclosed and its relationship to high-throughput experimentation (HTE) datasets is described. For chemical yield predictions, a key task in chemical synthesis, an attributed graph neural network (AGNN) performs as well as or better than the best previous models on two HTE datasets for the Suzuki–Miyaura and Buchwald–Hartwig reactions. However, training the AGNN on an ELN dataset does not lead to a predictive model. The implications of using ELN data for training ML-based models are discussed in the context of yield predictions.
more » « less
Full Text Available
SD^2: Slicing and Dicing Scholarly Data for Interactive Evaluation of Academic Performance

https://doi.org/10.1109/TVCG.2022.3163727

Guo, Zhichun; Tao, Jun; Chen, Siming; Chawla, Nitesh V.; Wang, Chaoli (April 2022, IEEE Transactions on Visualization and Computer Graphics)

Full Text Available
Sentence-Permuted Paragraph Generation

https://doi.org/10.18653/v1/2021.emnlp-main.412

Yu, Wenhao; Zhu, Chenguang; Zhao, Tong; Guo, Zhichun; Jiang, Meng (January 2021, Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing)

Generating paragraphs of diverse contents is important in many applications. Existing generation models produce similar contents from homogenized contexts due to the fixed left-to-right sentence order. Our idea is permuting the sentence orders to improve the content diversity of multi-sentence paragraph. We propose a novel framework PermGen whose objective is to maximize the expected log-likelihood of output paragraph distributions with respect to all possible sentence orders. PermGen uses hierarchical positional embedding and designs new procedures for training, and decoding in the sentence-permuted generation. Experiments on three paragraph generation benchmarks demonstrate PermGen generates more diverse outputs with a higher quality than existing models.
more » « less
Full Text Available
Few-Shot Graph Learning for Molecular Property Prediction

https://doi.org/10.1145/3442381.3450112

Guo, Zhichun; Zhang, Chuxu; Yu, Wenhao; Herr, John; Wiest, Olaf; Jiang, Meng; Chawla, Nitesh V. (April 2021, Few-Shot Graph Learning for Molecular Property Prediction)

Full Text Available
Action Sequence Augmentation for Early Graph-based Anomaly Detection

https://doi.org/10.1145/3459637.3482313

Zhao, Tong; Ni, Bo; Yu, Wenhao; Guo, Zhichun; Shah, Neil; Jiang, Meng (January 2021, Proceedings of the 30th ACM International Conference on Information & Knowledge Management)

The proliferation of web platforms has created incentives for online abuse. Many graph-based anomaly detection techniques are proposed to identify the suspicious accounts and behaviors. However, most of them detect the anomalies once the users have performed many such behaviors. Their performance is substantially hindered when the users' observed data is limited at an early stage, which needs to be improved to minimize financial loss. In this work, we propose Eland, a novel framework that uses action sequence augmentation for early anomaly detection. Eland utilizes a sequence predictor to predict next actions of every user and exploits the mutual enhancement between action sequence augmentation and user-action graph anomaly detection. Experiments on three real-world datasets show that Eland improves the performance of a variety of graph-based anomaly detection methods. With Eland, anomaly detection performance at an earlier stage is better than non-augmented methods that need significantly more observed data by up to 15% on the Area under the ROC curve.
more » « less
Full Text Available

« Prev Next »

Search for: All records